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segmentation, object recognition or information retrieval. It is often an issue 
of difficulty for traditional clustering technique due to non-optimal result 
exist because of the presence of outliers and noise data. This review paper 


provides a review of single clustering methods that were applied in various 


domains. The aim is to see the potential suitable applications and aspect of 
Keywords: improvement of the methods. Three categories of single clustering methods 
were suggested, and it would be beneficial to the researcher to see the 
clustering aspects as well as to determine the requirement for clustering 
method for an employment based on the state of the art of the previous 
research findings. 
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1. INTRODUCTION 

Identification of homogeneous group is an exploratory cluster analysis has been researched since 
few decades ago [1-5]. Various applications used clustering methods as such of neural network [6-7], 
artificial intelligence [8-11] and statistics [12-16]. There are several clustering methods that have been 
enhanced to cater the issues of dataset features such as big data sets [17], high-dimensional [18] and 
distributed data sets [19].Clustering can be divided into two such as traditional technique includes hierarchy 
based, partition based, fuzzy based, distribution based, density based, graph based, grid based, fractal based 
and model based and modern techniques include kernel based, ensemble based, swarm intelligence based, 
quantum theory based, spectral graph theory based, affinity propagation based, distance and density based, 
spatial, data based, data stream based and large scale data based [20-21]. There are several issues of 
traditional clustering technique that arises in the study such as when the partition set of patterns to be 
clustered is large or the features reaches higher dimensions. Thus, the judgement would be difficult to justify, 
obtain and take time to execute [22-24]. 

Traditionally clustering techniques are broadly used are hierarchical, partitioning, grid based and 
density based clustering analysis [25-26]. Partitioning clustering such as k-means are useful to determine the 
number of clusters in the data. However, each pattern of mostly traditional clustering techniques which 
belongs to one cluster and provides a single view [27-28]. The importance view and less importance view of 
data clustering are treated equally, therefore, there is a lack to distinguish to compare. Traditional clustering 
can perform poorly when the non-optimal result exist in terms of between-cluster heterogeneity and highly 
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feature dimension due to the presence of multicollinearity among the variables or high skewness of 
the clustering variables [29]. This paper addresses the review of traditional single clustering methods. 
The remainder of this article is organized as follows: In Section 2, we present the approaches in methods for 
clustering in terms of single clustering solution. Section 3 is a discussion and a highlights drawback of single 
clustering methods and finally presents a conclusion. 


2. SINGLE CLUSTERING TECHNIQUES 

Many previous studies that used different clustering methods in a single and multiple solution for a 
given problem of classification, image segmentation and pattern recognition. Merging similar items 
and separating dissimilar items in different groups and only provides one single solution known as 
single clustering. This review divides the single clustering into three categories, namely hierarchical methods, 
partition method and other clustering methods. Hierarchical methods and partition methods are the most used 
in many domains as demonstrated in Figure 1. The following subsections discuss each category. 
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Figure 1. Single clustering methods 
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2.1. Hierarchical clustering methods 

Hierarchical clustering as a recursive partitioning perform clustering from various types of data sets. 
It can be classified into agglomerative and divisive. Agglomerative clustering method is normally known as 
bottom-up approach where each data point represents a single cluster at the beginning and these clusters 
will be merged based on the similarities and all points belong to the same cluster considering the most two 
similar cluster [30]. Meanwhile, divisive is a top-down approach is the inverse of agglomerative clustering. 
Initially, all points belong to the same cluster, which assigned all of the observations to single cluster and 
then partition the cluster into two least similar clusters and proceed on each cluster until there is one cluster 
for each observation [31-32]. Various research on hierarchical methods for clustering analysis were done 
include the application of Ward hierarchical to find subgroups regarding self-reported physical and mental 
health status of patients [33] and applied hierarchical cluster analysis to investigate the impact of parenting 
styles with adolescent self-esteem of middle school student in Korean culture. In addition, the used of cluster 
analysis to examine the effect of residents’ attitudes in the Balearic Islands of Spain regard tourism on 
community for five different opinion groups [34] meanwhile the investigated service quality in a 
metropolitan public bus service quality of Granada using cluster analysis and a Pearson correlation in order to 
discover the variables that give effect to the passengers’ about the service [35]. Those hierarchical methods 
provide solutions for clustering different types of datasets, however, it is still a lack of accuracy and require 
more datasets for a good clustering solution. 

For instance, hierarchical cluster analysis and expectation maximum (EM) algorithm were applied 
based on Gaussian mixture model (EMGM) in order to evaluate the mass spectrometry of the soluble species 
of Shengli lignite into meaningful groups [36] still need improvement in terms of results [12]. 
Another application that used hierarchical clustering is to obtain the personality of similarity on different 
group of individuals and the characteristics of cultural perceived the transport risks. In this case, the Ward 
method as a cluster analysis and principal component analysis were implemented in evaluating the quality of 
a tourist destination [15]. Additionally, cluster analysis was used in predicting the cluster of Mid-Atlantic 
wine market based on purchasing behavior, attitudes and social demographic attributes [16]. 
Various domains of problem performed by Principal Component Analysis (PCA) such as to upgrade the 
consumption data of electricity and the result showed that it can decrease the time of examining as well as 
increases the accuracy of clustering and applied in identifying control, heavy metal variability in soil and dust 
at children’s playgrounds and reflect potential sources in a different group of location [37]. 

A method of hierarchical cluster analysis with Euclidean distances and Ward’s aggregation 
method were performed in classifying the groups of consumers with different perceptions towards 
the different types of chocolate healthiness [38]. Hierarchical clustering is applied to identify the 
segmentation of tourist visitors of the chosen destinations on memorable tourism experience rates [39]. 
Correlation analysis, principal component analysis (PCA) and cluster analysis (CA) were used to investigate 
air pollution for the concentrations of nitrogen oxides, ozone, and particulate matter [40]. Ward methods as 
clustering technique was applied in defining consumer perception on food choice and quality in different 
selected Central European countries such as Poland, the Slovak republic and the Czech Republic [41]. 
Despite all findings from the mentioned above, all methods are feasible for the clustering results, however a 
lot of improvement when consider large data sets, numbers of variables and its variety. It is a challenge when 
to solve big data solutions. 


2.2. Partition clustering methods 

Partitioning clustering method occurs when each of the object must belong to exactly one group and 
each group must contain at least one object. Partitioning clustering method occurs when each of the object 
must belong to exactly one group and each group must contain at least one object [42]. A very popular 
K-means methods falls under this category. K-means clustering were used in many domain applications 
includes identifying patients’ perception of the different quality perspectives [43], to examine the clusters 
from self-care behaviors associated with sleep quality and mental health and to the cluster perception of 
illness patients as well as in a hybridation with other method [57]. The work of [2] compared the method of 
clustering such as Average Linkage, Complete Linkage, K-means clustering to examine the perception of 
illness and the findings revealed that K-means was given the better and appropriate result for illness research. 
In other research, the method of Rank-Order clustering algorithm was compared with the K-Means and 
Spectral in order to investigate the run-time complexity and cluster quality in terms of external and internal 
of face label and the findings revealed the performances of Rank-Order clustering algorithm is better than 
K-Means and Spectral [44]. K-Means and Spectral clustering were applied to calculate the distance between 
two clusters of ordinal survey data in a highly structured ranking, and the results indicated that the 
combination of K-Means and Spearman rank-order coefficient outperforms in the least of executions 
of time [45]. Physiological signals of human personalized stress used k-means and regression neural network 
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were evaluated where the finding indicates that the accuracy was improved compared to traditional technique 
without clustering [46]. K-Means cluster analysis was applied to interpret the sources, temporal and spatial 
trends of ultrafine particles (UFP). Latent Dirichlet Allocation and the VSM model with K-Means clustering 
were applied to calculate the similarity of text for a cluster analysis, then the conclusion shown that the 
accuracy of clustering algorithms is enhanced [47]. Euclidean Distance K-means method was used to 
evaluate the public perception of social and environmental impacts of hydropower projects [14]. 
Fuzzy clustering and K-means clustering techniques were compared to the creation of perfectionism profiles, 
then for the fuzzy clustering, the similarity depends on the quantity of cluster overlap [48]. 

Traditional K-Means with improved algorithm such as K-Protypes and improved K-Protypes 
algorithm in evaluating Intrusion Detection System and the results revealed that improved algorithm can 
achieve a higher detection rate and lower false alarm rate than the traditional K-Means [49]. 
Logistic regression and the K-Mean clustering technique were used in clusters alumni into segments in terms 
of the characteristics, lifestyles, types of behavior, and interests of alumni [50]. K-means clustering 
techniques was evaluated by examining the attitudes, opinions and interests in forest certification by 
segmentation of respondents of a landowner survey in Shandong, China [51]. 


2.3. Other single clustering methods 

This section will briefly elaborate the application of other single clustering methods in which are not 
categorized under both hierarchical and partition methods. As demonstrated in Figure 1, five methods include 
a two step cluster analysis, five factor model, optimized cluster storage method, Spatial “K’ luster analysis 
(SKATER), and Genetic-evolutionary Random Support Vector Machine Cluster will be focused. 
A two-step cluster analysis, which is profiled analysis was implemented in order to study the variation the 
characteristics in different student segments with gender as a categorical variable and task value of students’ 
perceptions, self-efficacy and attitudes on the usage and applications of computers [52]. The two step cluster 
analysis was applied to examine the differences of profile analysis in relative levels of self-determination and 
consequences of physical education motivation [53]. An optimized cluster storage method was proposed in 
order to cut the utilization and cost of data storage in the file for real-time big data in the Internet of Things, 
and the results showed the effectiveness and 70% of storage cost less than traditional methods meanwhile 
SKATER, fuzzy clustering and Gaussian mixture modelling for model-based clustering, and the result shows 
SKATER technique of individual perception effectively reflect the internal heterogeneities of citizens’ 
perceptions across multiple scales [54]. 

Genetic-evolutionary Random Support Vector Machine Cluster (GERSVMC) algorithm was 
proposed to evaluate the accuracy for Autism Spectrum Disorder (ASD) patients and the results indicates that 
GERSVMC is sufficient with 96.8% of accuracy [9]. Clustering with high dimensional Gaussian mixtures 
was proposed based on the expectation-maximization (EM) algorithm and a direct estimation method for the 
sparse discriminant vector, then the findings revealed that the proposed method shows better performance 
compared to conventional low-dimensional in terms of the optimal rate of convergence. Five Factor Model as 
a cluster model was applied based to find subtypes of personality traits [55]. 


3. DISCUSSION 
The traditional clustering techniques are broadly studied in few decades ago. It only considers one 
single way to partition data into groups. The aim of clustering is to identify group or clusters that are like 
each other than to other clusters. There are several advantages of single clustering which are simple as well 
as ease of handling and understanding of any types of similarity and straightforward. Eventhough traditional 
clustering technique have been useful, it is quite challenging to handle situations as sich when clustering 
processes in the population of overlapping or ambiguous. However, k-means work well when the shape of 
the clusters is hyper spherical. K-means also requires prior knowledge of clusters. The drawback of 
traditional cluster analysis as stated in the following: 
— Limited of space and time complexity 
Traditional clustering method are expensive in terms computational and storage requirements. 
The storage required for the hierarchical clustering technique is very high when the number of data 
points are high as need to store the similarity matrix in the random-access memory. The time is limited 
as there is a need to perform numbers iterations and, in each iteration, need to update the similarity 
matrix. Therefore, it seems not suitable for for large data. 
— Population overlap 
Generally, cluster made up of neighbouring elements, therefore, the elements within a cluster tend 
to be homogeneous. The situation where the elements appears simultaneously in more than one cluster 
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called overlapping cluster analysis. The redundancy of information occurs if the cluster 
analysis overlapped. Therefore, there was an inaccurate description of data if use traditional clustering. 

— Limited of information knowledge 
Most of traditional clustering cannot capture variety in various application due to interpretation of 
traditional clustering analysis was too strict since it can only interpret set of groups. For instance, 
grouping topic or region in a one cluster in a news and webpages [56]. 

— Low performance accuracy 
Most of essential parameters of conventional clustering are determined by the human user’s experience 
and cannot obtain balance between accuracy and efficiency of the clustering process [58]. 
The accuracy performance of traditional clustering becomes low since importance view and less 
important view of respondents are treated equally. For example, misinterpretation of the hierarchical 
analysis such as dendrogram. 


4. CONCLUSIONS 

Clustering is an unsupervised machine learning and can be used to improve the accuracy of 
supervised machine learning algorithms by clustering the data points into similar groups. From the reviews, 
many single clustering methods were applied in many real-world problems such as transport, finance, 
behavioral and health. It seems to offer a feasible solution, however, the accuracy is still low and require 
more computational resources. Some drawback of the methods is briefly discussed. One of the important 
aspects of clustering such as outliers and insufficient population needs to cater even though clustering is easy 
to implement in a various number of domains has to be considered for an improvement of the methods or a 
new method to be constructed. 
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